Genotype-based matching to correct for population stratification in large-scale case-control genetic association studies.
نویسندگان
چکیده
Genome-wide association studies are helping to dissect the etiology of complex diseases. Although case-control association tests are generally more powerful than family-based association tests, population stratification can lead to spurious disease-marker association or mask a true association. Several methods have been proposed to match cases and controls prior to genotyping, using family information or epidemiological data, or using genotype data for a modest number of genetic markers. Here, we describe a genetic similarity score matching (GSM) method for efficient matched analysis of cases and controls in a genome-wide or large-scale candidate gene association study. GSM comprises three steps: (1) calculating similarity scores for pairs of individuals using the genotype data; (2) matching sets of cases and controls based on the similarity scores so that matched cases and controls have similar genetic background; and (3) using conditional logistic regression to perform association tests. Through computer simulation we show that GSM correctly controls false-positive rates and improves power to detect true disease predisposing variants. We compare GSM to genomic control using computer simulations, and find improved power using GSM. We suggest that initial matching of cases and controls prior to genotyping combined with careful re-matching after genotyping is a method of choice for genome-wide association studies.
منابع مشابه
استفاده از Propensity Score برای همسان سازی نمونه ها در یک مطالعه مورد شاهدی
Background and Aim: Case-Control studies provide evidence in the area of health. Validity and accuracy of such studies depend to a large extent on the similarity (similar distributions) of the case and control groups according to confounding variables. Matching is a method for controlling or eliminating the effects of important confounders. Matching using propensity score has recently been intr...
متن کاملDetecting Genetic Association in Case-control Studies Using Similarity-based Association Tests
Although traditional case-control studies may be subject to bias caused by population stratification, alternative methods that are robust to population stratification such as family-based association designs may be less powerful due to overmatching between cases and controls. Furthermore, case-control studies have the advantages of easy sample collection. Recently, several statistical methods h...
متن کاملMicroarray genotyping resource to determine population stratification in genetic association studies of complex disease.
We have developed a robust microarray genotyping chip that will help advance studies in genetic epidemiology. In population-based genetic association studies of complex disease, there could be hidden genetic substructure in the study populations, resulting in false-positive associations. Such population stratification may confound efforts to identify true associations between genotype/haplotype...
متن کاملDetecting association in a case-control study while correcting for population stratification.
Case-control studies are subject to the problem of population stratification, which can occur in ethnically mixed populations and can lead to significant associations being detected at loci that have nothing to do with disease. Here, we describe a way to measure and correct for stratification by genotyping a moderate number of unlinked genetic markers in the same set of cases and controls in wh...
متن کاملDelta-centralization fails to control for population stratification in genetic association studies.
OBJECTIVE To investigate the validity of simulations and assumptions used to underpin the delta-centralization (DC) method for correcting for population stratification in genetic association studies; to assess the effectiveness of DC compared to genomic control (GC) under valid simulation conditions; and to highlight other studies employing similarly flawed simulations. METHODS DC and GC use ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Genetic epidemiology
دوره 33 6 شماره
صفحات -
تاریخ انتشار 2009